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ABSTRACT. The Katz-Sarnak density conjecture states that, in the limit as the analytic conductors 
tend to infinity, the behavior of normalized zeros near the central point of families of L-functions 
agree with the N — > oo scaling limits of eigenvalues near 1 of subgroups of U(N). Evidence for 
this has been found for many families by studying the rt-level densities; for suitably restricted test 
functions the main terms agree with random matrix theory. In particular, all one-parameter families of 
elliptic curves with rank r over Q(T) and the same distribution of signs of functional equations have 
the same universal limiting behavior for their main term. We break this universality and find family 
dependent lower order correction terms in many cases; these lower order terms have applications 
ranging from excess rank to modeling the behavior of zeros near the central point, and depend on the 
arithmetic of the family. We derive an alternate form of the explicit formula for GL(2) L-functions 
which simplifies comparisons, replacing sums over powers of Satake parameters by sums of the 
moments of the Fourier coefficients A/(p). Our formula highlights the differences that we expect to 
exist from families whose Fourier coefficients obey different laws (for example, we expect Sato-Tate 
to hold only for non-CM families of elliptic curves). Further, by the work of Rosen and Silverman we 
expect lower order biases to the Fourier coefficients in one-parameter families of elliptic curves with 
rank over Q(T); these biases can be seen in our expansions. We analyze several families of elliptic 
curves and see different lower order corrections, depending on whether or not the family has complex 
multiplication, a forced torsion point, or non-zero rank over Q(T). 



Assuming the Generalized Riemann Hypothesis (GRH), the non-trivial zeros of any L-function 
have real part equal to 1/2. Initial investigations studied spacing statistics of zeros far from the 
central point, where numerical and theoretical results [Hejj IMonl IQdl [ IOd2l IRS II showed excellent 
agreement with eigenvalues from the Gaussian Unitary Ensemble (GUE). Further agreement was 
found in studying moments of L-functions HCFI 1CFKRS , KeSnl, KeSn2, KeSn3] as well as low- 
lying zeros (zeros near the critical point). 

In this paper we concentrate on low-lying zeros of L(s, f), where / G H£(N), the set of all 
holomorphic cuspidal newforms of weight k and level N. Before stating our results, we briefly 
review some notation and standard facts. Each / e H%(N) has a Fourier expansion 
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Let Xf(n) = af(n)n ^ . These coefficients satisfy multiplicative relations, and |A/(p)| < 2. 
The L-function associated to / is 



oo 



H., /) = E M0 = n ( 1 _^ + ^j , (1 .2) 

n=l P 

where xo is tne principal character with modulus N. We write 

A/(p) = a f (p)+p f (p). (1.3) 
Forp-|iV", ctf(p)Pf(p) = 1 and |a/(p)| = 1. If p|iV we take a/(p) = A/(p) and (3f(p) = 0. Letting 

2 k \ 1/2 f Vn\* (s k-l\ „fs k + 1 



L ~<^> = [si) [— ) T {-2 + — ) r {2 + — ) <L4) 
denote the local factor at infinity, the completed L-function is 

A(s,f) = L^LisJ) = e f A(l -s,f), e f = ±1. (1.5) 

Therefore H%(N) splits into two disjoint subsets, H£(N) = {/ G #£(iV) : e/ = +1} and 
H^(N) = {/ G H£(N) : e/ = —1}. Each L-function has a set of non-trivial zeros p/j = |+fi7/,^. 
The Generalized Riemann Hypothesis asserts that all 7^ G M. 

In studying the behavior of low-lying zeros, the arithmetic and analytic conductors determine the 
appropriate scale. For / G H£(N), the arithmetic conductor Nf is the integer N from the functional 
equation, and the analytic conductor Qf is ^ ( fc+1 ^ fc+3 ) . The number of zeros within C units of the 
central point (where C is any large, absolute constant) is of the order logQ/. For us k will always 
be fixed, so Nf and Q / will differ by a constant. Thus log Qf ~ log iV/, and in the limit as the level 
N tends to infinity, we may use either the analytic or arithmetic conductor to normalize the zeros 
near the central point. See HHal IILSI for more details. 

We rescale the zeros and study jf, /°^ . We let T = U^-jv be a family of L-functions ordered by 
conductor (our first example will be = H%(N)', later we shall consider one-parameter families 
of elliptic curves). The n-level density for the family is 

' JV| f&F N li,...,in V 7 V 7 

where the 0, are even Schwartz test functions whose Fourier transforms have compact support and 
| + 87/£ runs through the non-trivial zeros of L(s, f). As the 0/s are even Schwartz functions, most 
of the contribution to D n ^{<p) arises from the zeros near the central point; thus this statistic is well- 
suited to investigating the low-lying zeros. For some families, it is more convenient to incorporate 
weights (for example, the harmonic weights facilitate applying the Peters son formula to families of 
cuspidal newforms). 

Katz and Sarnak [KaSal , KaSa2J conjectured that, in the limit as the analytic conductors tend to 
infinity, the behavior of the normalized zeros near the central point of a family T of L-functions 
agrees with the iV — > 00 scaling limit of the normalized eigenvalues near 1 of a subgroup of U(N): 

D n ,A<P) = /•••/ 0i (x 1 )---(j) n (x n )W nyG (r)(xi,...,x n )dx 1 ■■■dx n , (1.7) 
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where C7(jF) is the scaling limit of one of the following classical compact groups: N x N unitary, 
symplectic or orthogonal matrices!] Evidence towards this conjecture is provided by analyzing the 
n-level densities of many families, such as all Dirichlet characters, quadratic Dirichlet characters, 
L(s,ip) with ip a character of the ideal class group of the imaginary quadratic field Q(v /— D), 
families of elliptic curves, weight k level N cuspidal newforms, symmetric powers of GL(2) L- 
functions, and certain families of GL(4) and GL(6) L-functions; see llDMTl IFTl iGUl IHRl IHM1 HESl 
IKaSa21lMimiQ^lRRTl lRol imieryo2l . 

Different classical compact groups exhibit a different local behavior of eigenvalues near 1, thus 
breaking the global GUE symmetry. This correspondence allows us, at least conjecturally, to assign 
a definite "symmetry type" to each family of primitive L-functionso 

Now that the main terms have been shown to agree with random matrix theory predictions (at 
least for suitably restricted test functions), it is natural to study the lower order terms Jj In this paper 
we see how various arithmetical properties of families of elliptic curves (complex multiplication, 
torsion groups, and rank) affect the lower order terms. For families of elliptic curves these lower 
order terms have appeared in excess rank investigations [Mil3], and in a later paper [DHKMS] they 
will play a role in explaining the repulsion observed in [Mil4J of the first normalized zero above the 
central point in one-parameter families of elliptic curves. 

We derive an alternate version of the explicit formula for a family T of GL(2) L-functions 
of weight k which is more tractable for such investigations, which immediately yields a useful 
expansion for the 1-level density for a family T of GL(2) cuspidal newforms. We should really 

l Foi test functions supported in (—1, 1), the one-level densities are 



For families of zeta or L-functions of curves or varieties over finite fields, the corresponding classical compact group 
can be determined by the monodromy (or symmetry group) of the family and its scaling limit. No such identification 
is known for number fields, though function field analogues often suggest what the symmetry type should be. See also 
| DM2] for results about the symmetry group of the convolution of families, as well as determining the symmetry group 
of a family by analyzing the second moment of the Satake parameters. 

3 Recently Conrey, Farmer and Zirnbauer [CFZ1, CFZ2| conjectured formulas for the averages over a family of 
ratios of products of shifted L-functions. Their L-functions Ratios Conjecture predicts both the main and lower order 
terms for many problems, ranging from rt-level correlations and densities to mollifiers and moments to vanishing at 
the central point (see IICSI ). In |Mil6, Mil7| we verified the Ratios Conjecture's predictions (up to error terms of size 
0(X _1 / 2+c )!) for the 1-level density of the family of quadratic Dirichlet characters and certain families of cuspidal 
newforms for test functions of suitably small support. Khiem is currently calculating the predictions of the Ratios 
Conjecture for certain families of elliptic curves. 

4 While the main terms for one-parameter families of elliptic curves of rank r over Q(T) and given distribution of 
signs of functional equations all agree with the scaling limit of the same orthogonal group, in [Mill] potential lower 
order corrections were observed (see | FI , RR2 Yo 1 ] for additional examples, and [ Mil3 ] for applications of lower order 
terms to bounding the average order of vanishing at the central point in a family). The problem is that these terms are 
of size 1/logi?, while trivially estimating terms in the explicit formula lead to errors of size log log Rj log R; here 
log R is the average log-conductor of the family. These lower order terms are useful in refining the models of zeros 
near the central point for small conductors. This is similar to modeling high zeros of £(s) at height T with matrices of 
size N = log(T/27r) (and not the N — > oo scaling limits) [KeSnl KeSn2|; in fact, even better agreement is obtained 
by a further adjustment of N arising from an analysis of the lower order terms (see [BBLM DHKMS |). 



/ 4>{u)W\fi{u)du 
J (j)(u)Wi t ijs p (u)du 
J <j)(u)Wi t \j(u)du 



J (p(u)Wi t so( eve n){u)du 
J (j){u)W lt so iodlA )(u)du 



0(u) + ±0(0) 
0(u) + |0(O) 
0(u) + ±0(0) 
£(u) - ±0(0) 



(1.8) 
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write jFjy and Rn below to emphasize that our calculations are being done for a fixed N, and then 
take the limit as N —>■ oo. As there is no danger of confusion, we suppress the N in the and 
R N . ^ 

Let Nf be the level of / 6 T and let be an even Schwartz function such that has compact sup- 
port, say supp(0) C (—a, a). We weight each / G T by non-negative weights w R (f), where log R 
is the weighted average of the logarithms of the levels, and we rescale the zeros near the central 
point by (logi?)/27r (in all our families of interest, logi? ~ logiV). Set Wr(/) = J2f^ w R(f)- 
The 1-level density for the family T with weights %(/) and test function is 

J2 fe:F w R (f)(A(k)+\ogN f ) 



W R {F) \ogR 



0(0) 



E — (/)( ^ ) + log ^0(o ) + W + o fc ^), (1. 



with ^0) = Y\z)/Y(z), A(k) = ip(k/4) + ip((k + 2)/4) - 21og7r, and 



( ' = — — <110) 



The above is a straightforward consequence of the explicit formula, and depends crucially on having 
an Euler product for our L-functions; see HILSH for a proof. As is a Schwartz function, most of 
the contribution is due to the zeros near the central point. The error of size 1/ log 2 R arises from 
simplifying some of the expressions involving the analytic conductors, and could be improved to 
be of size 1/ log 3 R at the cost of additional analysis (see HYolll for details); as we are concerned 
with lower order corrections due to arithmetic differences between the families, the above suffices 
for our purposes. 

The difficult (and interesting) piece in the 1-level density is S^). Our main result is an alternate 
version of the explicit formula for this piece. We first set the notation. For each / 6 T, let 

S(p) = {fef:piN f }. (1.11) 

Thus for / i S{p), a f (p) m + P f (p) m = X f {p) m . Let 



fes(p) ffS( P ) 



we use the convention that 0° = 1; thus Aq^{p) equals the cardinality of S{p). 
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Theorem 1.1 (Expansion for S (J 7 ) in terms of moments of A/(p)). Let \ogR be the average log- 
conductor of a finite family of L -functions T, and let S(fF) be as in (11.101) . We have 



ALAp) logp 7- ( logp 



p m/2 \ og R^\ \ og R 
p m=l v 



m- 



-2my 2Ao ^ logp + 2Y 2Ao ^ p)logp ^(2 

YK ' p(p+l)\osR vlogR Y \ 



logp 

p(p+l)\ogR ^— ' plogR T \ \ogR 

r P 

„l/2 WR^ Wff +Z(/>lUj 



pV2 log/^Vjog^/ p 1 /2( p+ l)2 l QgjR 

_ ? ^2„f(p) logP 7 / logjA 2 ;? rm A 2 ,jr(p)(4p 2 + 3p+ l)logp 
^ plogtf 9 V log^y 1 J ^ p(p + l) 3 lo gj R 

<wm V" V" ^(p)p r/2 (p- l)logp n / _J_\ 
^(^) + S'o(^) + Si OF) + S 2 (F) + S A (.F) + O (t-^—) . (1.13) 



log"/? 



7/" we to 



4f(p) = E W M -T^r? S /= ' (L14) 



/6S(p) 

fry ?/ze geometric series formula we may replace Sa{^F) with S^T), where 

Remark 1.2. For a general one-parameter family of elliptic curves, we are unable to obtain exact, 
closed formulas for the r th moment terms A r ^(p)\ f° r sufficiently nice families we can find exact 
formulas for r < 2 (see HALMl IMiBH for some examples, with applications towards constructing 
families with moderate rank over Q(T) and the excess rank question). Thus we are forced to 
numerically approximate the A r ^(p) terms when r > 30 

We prove Theorem [TTT] by using the geometric series formula for J2 m >3( a f(p) I \fP) m ( an d s i m ~ 
ilarly for the sum involving f3f(p) m ) and properties of the Satake parameters. We find terms like 

1 A/(p) 3 - 3A/(p) _ 1 Xf(p) 2 - 2 

P 3 / 2 P+ 1 - \f{p)sfp~ P 2 P+ 1 - \f(p)y/p' 

While the above formula leads to tractable expressions for computations, the disadvantage is that 
the zeroth, first and second moments of A/(p) are now weighted by l/(p + 1 — A/(p)y^). For 
many families (especially those of elliptic curves) we can calculate the zeroth, first and second 
moments exactly up to errors of size 1/N e ; this is not the case if we introduce these weights in the 



5 This greatly hinders comparison with the L-Functions Ratios Conjecture, which gives useful interpretations for 



the lower order terms. In [CS | the lower order terms are computed for a symplectic family of quadratic Dirichlet L- 
functions. The (conjectured) expansions there show a remarkable relation between the lower order terms and the zeros 
of the Riemann zeta function; for test functions with suitably restricted support, the number theory calculations are 
tractable and in |Mil6 | are shown to agree with the Ratios Conjecture. 
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denominator. We therefore apply the geometric series formula again to expand 1/ (p + 1 — A/ (p) A /p) 
and collect terms. 

An alternate proof involves replacing each af(p) m + (3f(p) m for p E S(p) with a polynomial 
X]^=o c m.r^f(p) m , and then interchanging the order of summation (which requires some work, as 
the resulting sum is only conditionally convergent). The sum over r collapses to a linear combina- 
tion of polylogarithm functions, and the proof is completed by deriving an identity expressing these 
sums as a simple rational function^ 

Remark 1.3. An advantage of the explicit formula in Theorem II .11 is that the answer is expressed 
as a weighted sum of moments of the Fourier coefficients. Often much is known (either theoret- 
ically or conjecturally) for the distribution of the Fourier coefficients, and this formula facilitates 
comparisons with conjectures. In fact, often the r-sum can be collapsed by using the generating 
function for the moments of \f(p). Moreover, there are many situations where the Fourier coef- 
ficients are easier to compute than the Satake parameters; for elliptic curves we find the Fourier 
coefficients by evaluating sums of Legendre symbols, and then pass to the Satake parameters by 
solving oe(p) = 2 v /pcos9 E (p). Thus it is convenient to have the formulas in terms of the Fourier 

coefficients. As Ap(jp) = 0(1 /p)), these sums converge at a reasonable rate, and we can evaluate 
the lower order terms of size 1 / log R to any specified accuracy by simply calculating moments and 
modified moments of the Fourier coefficients at the primes. 

We now summarize the lower order terms for several different families of GL(2) L-functions; 
many other families can be computed through these techniques. The first example is analyzed in 
§[3l the others in §[51 Below we merely state the final answer of the size of the 1 / log R term to a few 
digits accuracy; see the relevant sections for expressions of these constants in terms of prime sums 
with weights depending on the family. For sufficiently small support, the main term in the 1 -level 
density of each family has previously been shown to agree with the three orthogonal groups (we can 
determine which by calculating the 2-level density and splitting by sign); however, the lower order 
terms are different for each family, showing how the arithmetic of the family enters as corrections 
to the main term. For most of our applications we have weight 2 cuspidal newforms, and thus the 
conductor-dependent terms in the lower order terms are the same for all families. Therefore below 
we shall only describe the family-dependent corrections. 

• All holomorphic cusp forms (Theorem 13.41 ): Let be either the family of even weight 
k and prime level iV cuspidal newforms, or just the forms with even (or odd) functional 
equation. Up to 0(log~ 3 R), for test functions <fi with supp(0) C (—4/3, 4/3), as N — > oo 



6 The polylogarithm function is \A s (x) = Y^k=i k~ s x k . If s is a negative integer, say s — — r, then the polylogarithm 
function converges for \x\ < 1 and equals X)j=o( ~/ 0- ~ X Y +1 ^ where the ( r .) are the Eulerian numbers (the 
number of permutations of {1, . . . , r} with j permutation ascents). In jMil5 | we show that if ae,i is the coefficient of 
k l in Y\*Zl(k 2 - j 2 ), and b iti is the coefficient of k l in (2k + 1) YljZl(k ~j)(k + l + j), then for \x\ < 1 and I > 1 



we have 



ai t uti--2l{z) H 1- a^oLio^) 



(2i)\ ar*(l + a;) 



2 (1 - x) 2 ^ 1 



bg,2£+iLi-2e-i(x) H h bi fi Li (x) 




(1.17) 



Another application of this identity is to deduce relations among the Eulerian numbers. 
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the (non-conductor) lower order term is 

- 1.33258 -20(0)/ log R. 



(1.18) 



Note the lower order corrections are independent of the distribution of the signs of the func- 
tional equations. 

CM example, with or without forced torsion (Theorem 15.61) : Consider the one-parameter 



families y z 



+ B(6T + 1) K over 



with B G {1,2,3,6} and k G {1,2}; these 



families have complex multiplication, and thus the distribution of their Fourier coefficients 
does not follow Sato-Tate. We sieve so that (6T + 1) is (6/k) -power free. If k — 1 then all 
values of B have the same behavior, which is very close to what we would get if the average 
of the Fourier coefficients immediately converged to the correct limiting behaviorO If k = 2 
the four values of B have different lower order corrections; in particular, if B = 1 then there 
is a forced torsion point of order three, (0, 6T + 1). Up to errors of size 0(log~ 3 R), the 
(non-conductor) lower order terms are approximately 



B 
B 
B 
B 



1,/c = 1 

1, k = 2 

2, k = 2 

3, k = 2 



B = 6,k 



-2.124 -20(0)/ log R, 
-2.201 -20(0)/ log R, 
-2.347 -20(0)/ log R 
-1.921 -20(0)/ log .R 
-2.042 -20(0)/ log 



(1.19) 



• CM example, with or without rank (see £ 15.21 ): Consider the one-parameter families 
y 2 = x 3 _ S ( 36T + 6)(36T + 5)x over Q(T), with 5 G {1,2}. If B = 1 the family 
has rank 1, while if B = 2 the family has rank 0; in both cases the family has complex 
multiplication. We sieve so that (36T + 6)(36T + 5) is cube-free. The most important 
difference between these two families is the contribution from the S^T) terms, where the 

B = 1 family is approximately — .11 ■ 20(0)/ log R, while the B = 2 family is approxi- 
mately .63 ■ 20(0)/ logi?. This large difference is due to biases of size — r in the Fourier 
coefficients a t (p) in a one-parameter family of rank r over Q(T). Thus, while the main 
term of the average moments of the p th Fourier coefficients are given by the complex mul- 
tiplication analogue of Sato-Tate in the limit, for each p there are lower order correction 
terms which depend on the rank. This is in line with other results. Rosen and Silverman 
URoSill prove J2t modp a *G°) ls related to the negative of the rank of the family over Q(T); 
see Theorem 15 .8| for an exact statement. 

• Non-CM Example (see Theorem 15.141 ): Consider the one-parameter family y 2 = x 3 — 
3x + 12T over Q(T). Up to 0(log~ 3 i?), the (non-conductor) lower order correction is 
approximately 

- 2.703 -20(0) /logi?, (1.20) 

n 

In practice, it is only as p — > oo that the average moments converge to the complex multiplication distribution; 
for finite p the lower order terms to these moments mean that the answer for families of elliptic curves with complex 
multiplication is not the same as what we would obtain by replacing these averages with the moments of the complex 
multiplication distribution. 
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which is very different than the family of weight 2 cuspidal newforms of prime level N. 



Remark 1.4. While the main terms of the 1-level density in these families depend only weakly on 
the family^ we see that the lower order correction terms depend on finer arithmetical properties 
of the family. In particular, we see differences depending on whether or not there is complex 
multiplication, a forced torsion point, or rank. Further, the lower order correction terms are more 
negative for families of elliptic curves with forced additive reduction at 2 and 3 than for all cuspidal 
newforms of prime level N — > oo. This is similar to Young's results HYolL where he considered 
two-parameter families and noticed that the number of primes dividing the conductor is negatively 
correlated to the number of low-lying zeros. A better comparison would perhaps be to square-free 
N with the number of factors tending to infinity, arguing as in HILSII to handle the necessary sieving. 

Remark 1.5. The proof of the Central Limit Theorem provides a useful analogy for our results. If 
X%, . . . , X N are 'nice' independent, identically distributed random variables with mean ji and vari- 
ance a 2 , then as iV — ► oo we have (X 1 + • ■ • + Xn — Nfi) / ayN converges to the standard normal. 
The universality is that, properly normalized, the main term is independent of the initial distribu- 
tion; however, the rate of convergence to the standard normal depends on the higher moments of the 
distribution. We observe a similar phenomenon with the 1-level density. We see universal answers 
(agreeing with random matrix theory) as the conductors tend to infinity in the main terms; how- 
ever, the rate of convergence (the lower order terms) depends on the higher moments of the Fourier 
coefficients. 

The paper is organized as follows. In Owe review the standard explicit formula and then prove 
our alternate version (replacing averages of Satake parameters with averages of the Fourier coef- 
ficients). We analyze all cuspidal newforms in $3] After some preliminary expansions for elliptic 
curve families in §|4l we analyze several one-parameter families in §[51 

2. Explicit Formulas 

2.1. Standard Explicit Formula. Let <p be an even Schwartz test function whose Fourier trans- 
form has compact support, say supp(0) C (— er, a). Let / be a weight k cuspidal newform of level 
N; see (|l.ll) through (11.51) for a review of notation. The explicit formula relates sums of <fi over 
the zeros of A(s, /) to sums of and the Fourier coefficients over prime powers. We have (see for 
example Equations (4.1 1)-(4. 13) of lOXST ) that 




A k ,N((j>) 

hgR 




p m=l 



(2.1) 



All that matters are the first two moments of the Fourier coefficients. All families have the same main term in 
the second moments; the main term in the first moment is just the rank of the family. See |Mil2| for details for one- 
parameter families of elliptic curves 
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where 



2 



A KN (4>) = 20(0) log ( ^ ) + A k,N;j (0) , 



3=1 



Ai,, X:j (<>) = I t • ( aj + ~ + J 0(x)cfe, 



(2.2) 



with $(z) = T'{z)/T{z), = and a 2 = 

In this paper we concentrate on the first order correction terms to the 1-level density. Thus 
we are isolating terms of size 1/ logi?, and ignoring terms that are 0(1/ log 2 R). While a more 
careful analysis (as in HYollO would allow us to analyze these conductor terms up to an error of size 
0(log -3 R), these additional terms are independent of the family and thus not as interesting for our 
purposes. We use (8.363.3) of flGRl (which says ip(a + 68) + ip(a - 66) = 2ip(a) + 0{b 2 /a 2 ) for 
a, 6 real and a > 0), and find 

A KN -M = f W + 7 )+ ( ~ ■ ^,,-2^ • ( 23 ) 

This implies that 



V VK' + ipiog^ 



A fc)JV (0) = 0(0) logiV + 0(0) ( V ( 7 J + V f ^ ) - 21ogvr 



.4/ V 4 

As we shall consider the case of k fixed and N — > oo, the above expansion suffices for our purposes 
and we write 

A KN (<P) = ?(0) logiV + J(0)A(fc) + O fc f — ^- J • (2-5) 

Vlog i2y 

We now average (12.11 ) over all / in our family T. We allow ourselves the flexibility to introduce 
slowly varying non-negative weights wn(f), as well as allowing the levels of the / G T to vary. 
This yields the expansion for the 1-level density for the family, which is given by (11.91) . 

We have freedom to choose the weights wr(/) and the scaling parameter R. For families of 
elliptic curves we often take the weights to be 1 for t E [N, 2N] such that the irreducible polynomial 
factors of the discriminant are square or cube-free, and zero otherwise (equivalently, so that the 
specialization E t yields a global minimal Weierstrass equation); logi? is often the average log- 
conductor (or a close approximation to it). For families of cuspidal newforms of weight k and 
square-free level iV tending to infinity, we might take wn(f) to be the harmonic weights (to simplify 
applying the Petersson formula) and R around k 2 N (i.e., approximately the analytic conductor). 

The interesting piece in (1 1.9b is 
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We rewrite the expansion above in terms of the moments of the Fourier coefficients A/(p). If p\Nf 
then a f {p) m + (3 f {p) m = X f {p) m . Thus 

p\Nj> 



(2.7) 



In the explicit formula we have terms such as cfi(m logp/ log i?). As is an even function, Taylor 
expanding gives 

As we are isolating lower order correction terms of size 1/logi? in 5 , (^ r ), we ignore any term 
which is o(l/ log R). We therefore may replace 4>{m logp/ log R) with epilog p/ log i?) atacostof 
0(1/ log 3 fl) for all m > 30 which yields 

p\N f 

^ 1 / « A/(p) logp - / logp \ 

piNf 

, , 1 \- ^ A/(p) 2 -2 logp 2/ V°gP ' 



p m=3 lv ' /e^ 

P +JV/ 



(2.9) 



We have isolated the m — 1 and 2 terms from pj. iV/ as these can contribute main terms (and not just 
lower order terms). We used for p[Nf that a/(p)+/3/(p) = A/(p) and a/(p) 2 +/3/(p) 2 = A/(p) 2 — 2. 



2.2. The Alternate Explicit Formula. 



As 4> has compact support, the only m that contribute are m <C log i?, and thus we do not need to worry about the 
m-dependence in this approximation because these terms are hit by a p- m / 2 . 
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Proof of Theorem [7771 We use the geometric series formula for the m > 3 terms in (12.91) . We have 

a f {pf 



M 3 (p) := 



m=3 



+ 



(a f ( P y + ft(p) 3 ) ^ - (a f (p) 2 + ^(p) 2 ) 

p(p+ 1 - Xf(p)y/P) 

A / (p) 3 v ^-A / (p) 2 -3A / (p) v ^+2 



p(p + l- Xf(p)^/p) 

(2.10) 

where we use «/(p) 3 + (3f{p) 3 = A/(p) 3 — 3A/(p) and «/(p) 2 + (3f(p) 2 = A/(p) 2 — 2. Writing 

(p+1 — A/(p)y / p) _1 as (p+l) _1 ^1 — Xf ^_f^ j , using the geometric series formula and collecting 
terms, we find 

M 3 (p) - 



p(p + 1) 



Vp(3p + l)A,(p) (p 2 + 3p + l)A/(^ + A p r / 2 (p - l)A,(p)' 



p(p + l) 2 



P(P+ 1; 



r=3 



(P + 1)' 



r+1 



(2.11) 



We use (|2.8I) to replace 0(logp/ logi?) in (|2.9I) with 0(0) + 0(1/ log 2 i?) and the above expansion 
for M 3 (p); the proof is then completed by simple algebra and recalling the definitions of A r ^(p) 
and A' r ^{p), (Q2l) . " □ 

2.3. Formulas for the r > 3 Terms. For many families we either know or conjecture a distribution 
for the (weighted) Fourier coefficients. If this were the case, then we could replace the A r ^{p) with 
the r th moment. In many applications (for example, using the Petersson formula for families of 
cuspidal newforms of fixed weight and square-free level tending to infinity) we know the moments 
up to a negligible correction. 

In all the cases we study, the known or conjectured distribution is even, and the moments have a 
tractable generating function. Thus we may show 



Lemma 2.1. Assume for r > 3 that 



AAp) 



R 



o 



if r = 2£ 
otherwise, 



(2.12) 



and that there is a nice function gu such that 

g M (x) = M 2 x 2 + M 3 x 3 + 



5> 



£ X . 



(2.13) 



t=2 



Then the contribution from the r > 3 terms in Theorem \l.l\ is 
20(0) v-^ / p \ (p — 1) logp 



hgR 



9m 



(p + iy 



p + i 



o 



\oe?R 



(2.14) 
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Proof. The big-Oh term in A r ^(p) yields an error of size l/log 3 i?. The contribution from the 
r > 3 terms in Theorem [Tj] may therefore be written as 

The result now follows by using the generating function gu to evaluate the £-sum. □ 

Remark 2.2. In the above lemma, note that Qm{ x ) has even and odd powers of x, even though the 
known or conjectured distribution is even. This is because the expansion in Theorem 11.11 involves 
p r / 2 , and the only contribution is when r = 21, 

Lemma 2.3. If the distribution of the weighted Fourier coefficients satisfies Sato-Tate (normalized 
to be a semi-circle) with errors in the moments of size 0(1/ log 2 R), then the contribution from the 
r > 3 terms in Theorem \l.l\ is 

— bi7r- +0 fej- (2 - 16) 

where 

\- (2p+ l)logp , lfifW1/l ., n 01 _ 
7st ; J = 2^ p(p+l)3 -4160714430. (2.17) 

//"f/ie Fourier coefficients vanish except for primes congruent to a mod b (where 0(6) = 2) and the 
distribution of the weighted Fourier coefficients for p = a mod b satisfies the analogue of Sato- 
Tate for elliptic curves with complex multiplication, then the contribution from the r > 3 terms in 
Theorem li.il is 

_ 2 7 CM, a , 6 ?(Q) +Q f_4_y (2.18) 
2(3p + l)logp 

7CM,a,b = 2^ ( +1) 3 • ( 2 - 19 ) 

p=a mod 6 

7cmi,3 ~ .38184489, 7c mi,4 « 0.46633061. (2.20) 



In particular, 



Proof. If the distribution of the weighted Fourier coefficients satisfies Sato-Tate (normalized to be 
a semi-circle here), then M% = Ct = ^jr[( 2 /), the t Catalan number. We have (see sequence 
A000108 in (3D) 

g ST ( X ) = 1 ~ VI ~ 4X _ 1 _ x = 2x 2 + + Ux 4 + . . . = Yc £ X e 

2x -f— ' 

1=2 

The value for 7 ST . j was obtained by summing the contributions from the first million primes. 

For curves with complex multiplication, Mg — D(, — 2 ■ | Q ) ; while the actual sequence is just 
( 2 /) = + we prefer to write it this way as the first 2 emphasizes that the contribution is zero 
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for half the primes, and it is |( , ) that is the natural sequences to study. The generating function is 



oo 



<H ■ x i ( ./• ) = 1 y 1 = A x - 2x = 6x 2 + 20x 3 + 70x 4 + • • • = ^D e x e 



t=2 



( P \ _ 2(3p+l) 

these numbers are the convolution of the Catalan numbers and the central binomial (see sequence 
A000984 in ifSUl l The numerical values were obtained by calculating the contribution from the first 
million primes. □ 

Remark 2.4. It is interesting how close the three sums are. Part of this is due to the fact that these 
sums converge rapidly. As the small primes contribute more to these sums, it is not surprising that 

7cmi,4 > 7cmi,3 (the first primes for 7 C mi,4 are 5 and 11, versus 7 and 13 for 7cmi,3)- 

Remark 2.5. When we investigate one-parameter families of elliptic curves over Q(T), it is im- 
plausible to assume that for each p the r th moment agrees with the r th moment of the limiting 
distribution up to negligible terms. This is because there are at most p data points involved in the 
weighted averages A r ^(p); however, it is enlightening to compare the contribution from the r > 3 
terms in these families to the theoretical predictions when we have instantaneous convergence to 
the limiting distribution. 

We conclude by sketching the argument for identifying the presence of the Sato-Tate distribution 
for weight k cuspidal newforms of square-free level iV — > oo. In the expansion of A/(p) r , to first 
order all that often matters is the constant term; by the Petersson formula this is the case for cuspidal 
newforms of weight k and square-free level iV — > oo, though this is not the case for families of 
elliptic curves with complex multiplication. If r is odd then the constant term is zero, and thus to 
first order (in the Petersson formula) these terms do not contribute. For r = 2£ even, the constant 
term is ^ ( 2 /) = = Cg, the & Catalan number. We shall write 

r/2 

W = J2 b ^-2k X f(P r ~ 2k )> (2-23) 

k=0 

and note that if r = 2£ then the constant term is 6 2 £,o — Cg. We have 



fes( P ) 



r/2 r/2 

—— v w R (f)y2Kr-2kXf(p r " 2k ) = yV r _2fcA-.^fc(p), (2-24) 

Wr{J-) — - — ' 



1<LT k=0 k=0 

feS(p) 



where 



AMP) = ~yw~~Fr) E Mf)^^)- (2-25) 

feS( P ) 

We expect the main term to be A 2 e,^-o, which yields the contribution described in (I2.16I ). 
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3. Families of cuspidal newforms 

Let T be a family of cuspidal newforms of weight k and prime level N; perhaps we split by 
sign (the answer is the same, regardless of whether or not we split). We consider the lower order 
correction terms in the limit as iV — ► oo. 

3.1. Weights. Let 

n|iV°° p\N V F 



00 \ J„2 



A/(n 2 ) _ ( N (s)L(sJ®f) 

n=l 



z(sj) = > = " JVV y ; v ; O- 1 ) 



note 



L(s, symV) = — — — — , /) = — — L(l, sym /). (3.2) 

l,n[*s) (,(/) 

To simplify the presentation, we use the harmonic weights^ 

w R U) = (n(2)/Z(1J) = C(2)/L(l,sym 2 /), (3.4) 

and note that 



12 

/6H*(iV) 



(3.5) 



we may take R to be the analytic conductor, so R = 15N/6Air 2 . We have introduced the harmonic 
weights to facilitate applying the Peters son formula to calculate the average moments A r ^(p) from 
studying A r jr. k {p). The Petersson formula (see Corollary 2.10, Equation (2.58) of [ILS]) yields, 
for m, n > 1 relatively prime to the level N, 

E Mmf(m)Xf(n) =S mn + ((Mf ^y ) , (3.6) 
where S mn — 1 if m — n and otherwise. 

3.2. Results. From Theorem ll.U there are five terms to analyze: Sa'{F), So{F), Sx{T), S 2 (3 r ) 
and S A (F). One advantage of our approach (replacing sums of o>f(p) r + (3f(p) r with moments of 
A/(p) r ) is that the Fourier coefficients of a generic cuspidal newform should follow Sato-Tate; the 
Petersson formula easily gives Sato-Tate on average as we vary the forms while letting the level 
tend to infinity, which is all we need here. Thus A T ^(p) is basically the r th moment of the Sato-Tate 
distribution (which, because of our normalizations, is a semi-circle here). The odd moments of the 
semi-circle are zero, and the (2£) th moment is Ci. If we let 



10 The harmonic weights are essentially constant. By llTTllHLII they can fluctuate within the family as 

N- 1 -' < fc uj R {f) « fc A^ 1+e ; (3.3) 
if we allow ineffective constants we can replace N e with log N for N large. 



LOWER ORDER TERMS IN 1 -LEVEL DENSITIES 



15 



then we find 



20(0) 
\ogR 



(3.8) 



and we are writing the correction term as a weighted sum of the expected main term of the moments 
of the Fourier coefficients; see Lemma [231 for another way of writing this correction. These expan- 
sions facilitate comparison with other families where the coefficients do not follow the Sato-Tate 
distribution (such as one-parameter families of elliptic curves with complex multiplication). 

Below we sketch an analysis of the lower order correction terms of size 1 / log R to families of 
cuspidal newforms of weight k and prime level N — > oo. We analyze the five terms in the expansion 

ofS^inTheoremHHJ 

The following lemma is useful for evaluating many of the sums that arise. We approximated 7pnt 
below by using the first million primes (see Remark [331 for an alternate, more accurate expression 
for 7pnt)- The proof is a consequence of the prime number theorem; see Section 8.1 of HYollI for 
details. 



Lemma 3.1. Let 9{t) 
test function, then 

2 \ogp - 



E p <t l°gP an d E(t) = 9(t) — t. If (j) is a compactly support even Schwartz 



logp 
p\ogR T \ \ogR 



where 



7pnt 



0(0) 20(0) 
2 \ogR 

E(t) 



1 + 



t 2 



1 + 



dt 



t 2 



dt 



O 



hg^R 



-1.33258. 



(3.9) 



(3.10) 



Remark 3.2. The constant 7pnt also occurs in the definition of the constants c 4 i and c 4j2 in HYolll , 
which arise from calculating lower order terms in two-parameter families of elliptic curves. The 
constants c^i and are in error, as the value of 7pnt used in HYoll double counted the +1. 

Remark 3.3. Steven Finch has informed us that 7pnt = ~ 7 — J20-°SP) I (P 2 ~ P)'-> see 

http : / / www . research . at t . com/~nj as /sequences /AO 8 334 3 for a high precision 

evaluation and HLan[ IRoScl for proofs. 

Theorem 3.4. Let be supported in (—a, a) for some a < 4/3 and consider the harmonic weights 



Then 



where 



0(0) 



7ST;0 



7ST;2 



w R (f) = C(2)/L(l,sym 2 /). 

2(-7ST;0 + 7ST;2 ~ 7 S T;.A + 7PNt)0(O) 



log ill 



+ 



log 3 R 



(3.11) 



(3.12) 



E 



2 logp 

p pJp+T) 



E 



(4p 2 +3p+l)logp 
p p(p+l) 3 



^ST; 



A 



7pnt 



0.7691106216 
1.1851820642 
0.4160714430 
-1.33258 



(3.13) 
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and 

-7ST;0 +7ST;2 ~ 1 S T;A = ' ( 3 - 14 ) 

The notation above is to emphasize that these coefficients arise from the Sato-Tate distribution. 
The subscript (resp. 2) indicates that this contribution arises from the A ^(p) (resp. A 2i r(p)) 
terms, the subscript A indicates the contribution from (the A r ^(p) terms with r > 3), 

and we use PNT for the final constant to indicate a contribution from applying the Prime Number 
Theorem to evaluate sums of our test function. 

Proof. The proof follows by calculating the contribution of the five pieces in Theorem 11.11 We 
assume is an even Schwartz function such that supp(0) C (—a, a), with a < 4/3, T is the 
family of weight k and prime level N cuspidal newforms (with N — > oo), and we use the harmonic 
weights of §13.11 Straightforward algebra showf] 

(1) Sa>(F)<&N-V 2 . 

(2) s A (^ = _3^^ + (_^ 

functions supported in (—4/3, 4/3) we have Sa^) = — ^J^r^ + ^ (-^~ e )' where 7 ST . A 
w .4160714430 (see Lemmap. 

(3) S (F) = 0(O) + ^W-7s T; oWo) +o (_t_\ ? where7sT;0 = Ep ^siL w 0.7691106216, 

7pnt = 1 + Mjjjl dt « -1.33258. 

(4) « M 2^1 « at!-i i og at. 

(5) Assume a < 4. Then 

o / _ 0(0) 2 7PNT 0(0) 7sT;2 0(0) 
^2(>J o 777^ + 777"5 ' I i _ „3 



2 logi? logi? Vlog^i? 

+ 3p + 
p(p + 1 



7ST;2 = ^ (V + 3p+l)logP „ L1851g20642 (3 15) 



and 7pnt is defined in (13.101) . 
The Sa'{F) piece does not contribute, and the other four pieces contribute multiples of 7st ; o> 
7st;2, 7st;3 and 7PNT- □ 

Remark 3.5. Numerical calculations will never suffice to show that — 7st ; i + 7st ; 2 — 7st-.a * s 
exactly zero; however, we have 

sr( 2 4g 2 + 3p + l (2p+l)(p-l) ^ 1 

-7ST;0 +7ST;2-7ST^ = L ^^+1) + p(p + 1)3 p(p + 1)3 j ^ 

= J^O-logp = 0. (3.16) 

v 

This may also be seen by calculating the lower order terms using a different variant of the explicit 
formula. Instead of expanding in terms of aj(p) m + (3f(p) m we expand in terms of \f(p m ). The 



n 



Except for the Sa (T) piece, where a little care is required; see Appendix lAl for details. 



LOWER ORDER TERMS IN 1 -LEVEL DENSITIES 



17 



terms which depend on the Fourier coefficients are given by 

pf 1 / n A/(p) m logp logp\ ^ logp logp 

p\N m=l UK 1 f<EH*(N) y ta \ 6 / ppV^ 6 \ & 

v^v^ 1 m A/(p m )logp /-/ logp\ 1-/ l0 xlogp 

(3.17) 

this follows from trivially modifying Proposition 2. 1 of HYolH . For N a prime, the Petersson formula 
shows that only the second piece contributes for a < 4/3, and we regain our result that the lower 
order term of size 1 / log R from the Fourier coefficients is just 27 PNT 0(O) / log R. We prefer our 
expanded version as it shows how the moments of the Fourier coefficients at the primes influence 
the correction terms, and will be useful for comparisons with families that either do not satisfy 
Sato-Tate, or do not immediately satisfy Sato-Tate with negligible error for each prime. 

4. Preliminaries for Families of Elliptic Curves 

4.1. Notation. We review some notation and results for elliptic curves; see HKnllSilllSi2M for more 
details. Consider a one-parameter family of elliptic curves over Q(T): 

S: y 2 = x 3 + A(T)x + B(T), A(T), B(T) G Z[T]. (4.1) 

For each t G Z we obtain an elliptic curve E t by specializing T to t. We denote the Fourier 
coefficients by a t (p) = K(p)y/P', by Hasse's bound we have |a t (p)| < or |A t (p)| < 2. The 
discriminant and j -invariant of the elliptic curve E t are 

A(t) = -16(4A(t) 3 + 275(t) 2 ), j(t) = -1728-4A(t) 3 /A(t). (4.2) 

Consider an elliptic curve y 2 = x 3 + Ax + B (with A, B 6 Z) and a prime p > 5. As p > 5, 
the equation is minimal if either p 4 does not divide A or p 6 does not divide B. If the equation is 
minimal at p then 

{ x 3 + A{t)x + B{t)\ , , 

°*(P) = - Z2 { ) = P + l-^Vt(p), (4-3) 

a; mod p ^ 

where N t (p) is the number of points (including infinity) on the reduced curve E mod p. Note that 
a>t+mp{p) = <h{p)- This periodicity is our analogue of the Petersson formula; while it is significantly 
weaker, it will allow us to obtain results for sufficiently small support. 

Let E be an elliptic curve with minimal Weierstrass equation at p, and assume p divides the 
discriminant (so the reduced curve modulo p is singular). Then a^(p) G { — 1, 0, 1}, depending on 
the type of reduction. By changing coordinates we may write the reduced curve as (y — ax) (y — 
fix) = x 3 . If a = (3 then we say E has a cusp and additive (or unstable) reduction atp, and a E (p) = 
0. If a ^ j3 then E has a node and multiplicative (or semi- stable) reduction at p; if a, [3 G Q we 
say E has split reduction and Oe(p) = 1, otherwise it has non-split reduction and Oe(p) = —1. We 
shall see later that many of our arguments are simpler when there is no multiplicative reduction, 
which is true for families with complex multiplication. 

Our arguments below are complicated by the fact that for many p there are t such that y 2 = 
x 3 + A(T)x + B{T) is not minimal at p when we specialize T to t. For the families we study, the 
specialized curve at T = t is minimal at p provided p k (k depends on the family) does not divide a 



18 



STEVEN J. MILLER 



polynomial D{t) (which also depends on the family, and is the product of irreducible polynomial 
factors of A(t)). For example, we shall later study the family with complex multiplication 

y 2 = x s + B(QT + 1) K , (4.4) 

where 5|6°° (i.e., p\B implies p is 2 or 3) and k E {1, 2}). Up to powers of 2 and 3, the discriminant 
is A(T) = (6T + 1) 2k , and note that (<ot + 1, 6) = 1 for all t. Thus for a given t the equation is 
minimal for all primes provided that 6t + 1 is sixth-power free if k — 1 and cube-free if k = 2. In 
this case we would take D[t) — 6t + 1 and = 6/k. To simplify the arguments, we shall sieve our 
families, and rather than taking all t E [N, 2N] instead additionally require that D(t) is k th power 
free. Equivalently, we may take all t E [N, 2N] and set the weights to be zero if D(t) is not k th 
power free. Thus throughout the paper we adopt the following conventions: 

• the family is y 2 = x 3 + A(T)x + B(T) with A(T), B(T) E Z[T], and we specialize T to 
t E [N, 2N] with iV -> oo; 

• we associate polynomials D^T), . . . , D d {T) and integers k±, . . . , k d > 3, and the weights 
are Wu(t) — lift E [N, 2N] and Di(t) is A;/ h power free, and otherwise; 

• log R is the average log-conductor of the family, and log R = (1 + o(l)) log iV (see HDM21 

4.2. Sieving. For ease of notation, we assume that we have a family where D{T) is an irreducible 
polynomial, and thus there is only one power, say k; the more general case proceeds analogously. 
We assume that k > 3 so that certain sums are small (if k < 2 we need to assume either the ABC 
of Square-Free Sieve Conjecture). Let 5 k N d exceed the largest value of \D(t) \ for t E [N, 2N]. We 
say at £ [N, 2N] is good if D(t) is k th power free; otherwise we say t is bad. To determine the 
lower order correction terms we must evaluate 5*(JF), which is defined in (11.101) . We may write 

. 2N 

S(F) = ^-^Y,Mt)S(t). (4.5) 

As Wn(t) = if t is bad, for bad t we have the freedom of defining S{t) in any manner we may 
choose. Thus, even though the expansion for a t (p) in (14.31) requires the elliptic curve E t to be 
minimal at p, we may use this definition for all t. We use inclusion - exclusion to write our sums 
in a more tractable form; the decomposition is standard (see, for example, [Mil2J). Letting t be an 
integer (its size will depend on d and k), we have 

1 2N 

D{t) k — power free 

log^Af 2N 5N d / k 2N 

D(i)=0 mod d fc s O(t)=0 mod d k 

(4.6) 

where /U is the Mobius function. For many families we can show that 

2N / N\ 



t=JV 
-D(t) = mod d fe 
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If this condition^ holds, then applying the Cauchy-Schwarz inequality to (14.61 ) yields 




(4.8) 



For all our families Wr{!F) will be of size iV (see [Mil2] for a proof). Thus for I sufficiently 
large the error term is significantly smaller than 1 / log 3 R, and hence negligible (remember log R = 
(1 + o(l)) log N). Note it is important that k > 3, as otherwise we would have obtained log N to 
a non-negative power (as we would have summed 1/d). For smaller k we may argue by using the 
ABC or Square-Free Sieve Conjectures. 

The advantage of the above decomposition is that the sums are over t in arithmetic progressions, 
and we may exploit the relation a t+mp (p) = a>t{p) to determine the family averages by evaluating 
sums of Legendre symbols. This is our analogue, poor as it may be, to the Petersson formula. 

There is one technicality that arises here which did not in [Mil2]. There the goal was only 
to calculate the main term in the n-level densities; thus "small" primes (p less than a power of 
logiV) could safely be ignored. If we fix a d and consider all t with D{t) =0 mod d k , we ob- 
tain a union of arithmetic progressions, with each progression having step size d k . We would 
like to say that we basically have (N/d k )/p complete sums for each progression, with summands 
a t (p) > a t Q +d k P {p)i a t +2d k P (p)> an d so on. The problem is that if p\d then we do not have a complete 
sum, but rather we have the same term each time! We discuss how to handle this obstruction in the 
next sub-section. 

4.3. Moments of the Fourier Coefficients and the Explicit Formula. Our definitions imply that 
A r ,r{p) is obtained by averaging \ t (p) r over all t E [N, 2N] such that p -{ A(£); the remaining t 
yield A' r r (p). We have sums such as 



In all of our families D{T) will be the product of the irreducible polynomial factors of A(T). For 
ease of exposition, we assume D{T) is given by just one factor. 

We expand S (J 7 ) and S(t) by using Theorem ll.il The sum of S(t) over t with D(t) = mod d k 
breaks up into two types of sums, those where A(t) = mod p and those where A(t) ^0 mod p. 
For a fixed d, the goal is to use the periodicity of the t-sums to replace A r ^{p) with complete sums. 

Thus we need to understand complete sums. If t E [N, 2N], d < \og e N and p is fixed, then the 
set of t such that D(t) =0 mod d k is a union of arithmetic progressions; the number of arithmetic 
progressions equals the number of distinct solutions to D(t) = mod d k , which we denote by 
v D (d k ). We have (N/ d k ) /p complete sums, and at most p summands left over. 




i=JV 
D(t)=0 mod d k 



(4.9) 



Actually, this condition is a little difficult to use in practice. It is easier to first pull out the sum over all primes p 
and then square; see |Mil2] for details. 
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Recall 



A r Ap) = E «*(/)*/(p) r > A 'rAp) = E «*(/)A/(p) p , (4- 



10) 



/e.F 



and set 



= E <pY = p r/2 E A ^) r > ^(p) = E a ^) r - ( 4 - n > 



i mod p 
P+A(t) 



t mod p 
p+A(t) 



t mod p 
p\A(t) 



Lemma 4.1. Let D be a product of irreducible polynomials such that ( i)for all t no two factors are 
divisible by the same prime; ( ii) the same k > 3 ( see the conventions on page 1781) is associated to 
each polynomial factor. For any £ > 7 we have 



KM 



AAp) 



r/2 



p ■ p 



1 vp{p k ) ( x v D {p k ^ 



p ■ p 



r/2 



P K \ p K 

1 , vp{.P k ) L _ vp{p k ) 



o 



o 



\og l ' 2 N 
1 



(4.12) 



Proof For our family, the d > \og e N terms give a negligible contribution. We rewrite A r ^(p) as 

1 



AAp) 



E w 



v ' te[N,2N},p+D(t) 
D(t) k — power free 

log* AT 



log* N 2N 

t^E^) E \t( P y + o (\o g ^ 2 n 



log* N 



t£[N,2N],p+D(t) 
D(t) = mod d k 



v D {d k )N/d k 
P 



E w 



i mod p 
p|D(i) 



log* TV 



log* AT 



v D {d k )N/d k 
P 



E w 



t mod p 
p\D{f) 



(4.13) 



where = 1 if p\d and otherwise. For sufficiently small support the big-Oh term above is 
negligible. As k > 3, we have 



W R (T) 



ivTT 1 



log* JV 



pk 



o 



N 



N y. p(d)u D (d k ) + ^ 



d=l 



log £/2 iV 



(4.14) 
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For the terms with fi(d)8 p u in (14.131) . we may write d as dp, with (d,p) = 1 (the //(d) factor forces 
d to be square-free, so p\ \d). For sufficiently small support, (14.131 ) becomes 



AAp) 



p ■ p 



r/2 



vp{p k 

pk 



vp{p k 

pk 



-1 



+ o(log^ /2 iV) ; 



(4.15) 



this is because 



1 lo |^ fi{d)u D {d k )N 



W R {T) 



d k 



fi(p)MP k )_ l J2 Mu D (d k )N 



d k 



vp{p k ) 

pk 



vp{p k 

pk 



o 



\og l ' 2 N, 



(4.16) 



(the last line follows because of the multiplicativity of v D (see for example [Nag]) and the fact that 
we are missing the factor corresponding to p). The proof for A' r r (p) follows analogously. □ 



We may rewrite the expansion in Theorem ll.il We do not state the most general version possible, 
but rather a variant that will encompass all of our examples. 

Theorem 4.2 (Expansion for S'(JF) for many elliptic curve families). Let y 2 = x 3 + A(T)x + B(T) 
be a family of elliptic curves over Q(T). Let A(T) be the discriminant (and the only primes dividing 
the greatest common divisor of the coefficients of A(T) are 2 or 3), and let D(T) be the product of 
the irreducible polynomial factors of A(T). Assume for all t that no prime simultaneously divides 
two different factors of D{f), that each specialized curve has additive reduction at 2 and 3, and that 
there is a k > 3 such that for p > 5 each specialized curve is minimal provided that D(T) is /c* 
power free (if the equation is a minimal Weierstrass equation for all p > 5 we take k = oo); thus 
we have the same kfor each irreducible polynomial factor of D(T). Let uoid) denote the number 
of solutions to D(t) = mod d. Set Wn(t) = liftE [N, 2N] and D(t) is k th power free, and 
otherwise. Let 

Ar,T(p) = °*(P) P = E A 'rAP) = E ^ 

t mod p t mod p t mod p 

P+A(i) P+A(t) p|A(t) 



p 3 / 2 (p+l-a t (p)) t f^ p p+ 1 - \ t (p)yjp 



t mod p 

p\.A(t) P+A(t) 



pn, . p„ 
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We have 

A m Av)H D ,k{p) hgp 



S{T) = -20(0) E 



pm+l l g R 
p m=l ^ to 



26(0) V 2A oAp) H D,k(p) lQ gP , 2 2A ^(p)H D , k (p) hgp ~ f hgp_ 
^ p 2 (p+l)logi? ^ p 2 lo gj R logi? y 

_ 2 Ax,t{v)HdAv) lo SP T / lQ gP \ , 2 T/ n x -Ai,.F(p)-^D,fc(p)(3p + 1) logp 

2^ r.2 loo- 7? 9 I loo- ff i ™ ' 2-^ n2^Ti 4-1^2 lncr F? 



p 2 hgR \hgRJ ^ p 2 (p+l) 2 \ogR 

f p 

_ 2 MAv) h dAv) lo &P 2 (o lo &p \ , o2({]\ V AHp)#i),fc(p)(V + 3 p + 1 ) 1q sp 

^ p 3 logf2 ^ V logi?; </>l p 3 (p+l)HogR 

-oMrts ST AAp)Hp, k (p)p 3/2 (p - 1) hgp ( _J_ 

w ^ p( P + lyiogR \\og\ 



= S A r(T) + S (^) + S 1 (^) + S 2 (T)+S^) + O{^-^. (4.18) 

If the family only has additive reduction (as is the case for our examples with complex multiplica- 
tion), then the A' m Ap) piece contributes 0. 



Proof. The proof follows by using Lemma 14.11 to simplify Theorem 11.11 and (12.81) to replace the 
6(mhgp/ \ogR) terms with 0(0) + 0(log~ 2 R) in the A' mJ :{p) terms. See Remark [L~2l for com- 
ments on the need to numerically evaluate the A^(p) piece. □ 

For later use, we record a useful variant of Lemma [3TTT 
Lemma 4.3. Let ip be the Euler totient function, and 

a ,b(t) = Y] hgp, E ah {t) = 9 a ,b(t) 777- (4.19) 

it* v(p) 

p— a mod b 

If 6 is a compactly support even Schwartz test function, then 

„v-21ogp-/ logp\ 0(0) 20(0)/ [°° 2E u (t) \ ( 1 



V 

where 



OCj 



t 2 

M 2% 4 (t) 
t 2 



7pnt ; i,3 = 1 + / |f ' dt « -2.375 

7pnt ; i,4 = 1+/ rft « -2.224; (4.21) 



7pnt ; i,3 7pnt ; i,4 were approximated by integrating up to the four millionth prime, 67,867,979. 
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Remark 4.4. Steven Finch has informed us that, similar to Remark 13 .31 using results from ULanl 
IMorll yields formulas for 7pnt ; i,3 an d 7pnt ; i,4 which converge more rapidly: 



7pnt ; i,3 = -2 T -41og27T + log3 + 61ogrQj -2 £ hgl> 
-2.375494 



7j2 _ r,5i, 3 (p) 
p=l,2 mod 3 ^ 



7pnt ; i,4 = -2 7 -31og27r + 41ogrQj -2 ^ log/ ' 



r,2 _ p5i, 4 (p) 
p=l,3 mod 4 ^ ' 

« -2.224837; (4.22) 
here 7 is Euler's constant and Si tTl (p) = 1 if p = 1 mod n and otherwise. 



5. Examples: One-parameter families of elliptic curves over Q(T) 

We calculate the lower order correction terms for several one-parameter families of elliptic curves 
over Q(T), and compare the results to what we would obtain if there was instant convergence (for 
each prime p) to the limiting distribution of the Fourier coefficients. We study families with and 
without complex multiplication, as well as families with forced torsion points or rank. We perform 
the calculations in complete detail for the first family, and merely highlight the changes for the other 
families. 

5.1. CM Example: The family y 2 = x 3 + B(6T + 1) K over Q(T). 

5.1.1. Preliminaries. Consider the following one-parameter family of elliptic curves over Q(T) 
with complex multiplication: 

y 2 = x 3 + B(6T + 1) K , Be {1,2,3,6}, kg {1,2}, k = 6/k. (5.1) 

We obtain slightly different behavior for the lower order correction terms depending on whether 
or not B is a perfect square for all primes congruent to 1 modulo 3. For example, if B = b 2 and 
K — 2, then we have forced a torsion point of order 3 on the elliptic curve over Q(T), namely 
(0, 6(6T + 1)). The advantage of using 6T + 1 instead of T is that (6T + 1,6) = 1, and thus we 
do not need to worry about the troublesome primes 2 and 3 (each a t (p) = for p E {2, 3}). Up to 
powers of 2 and 3 the discriminant is (6T + 1) K ; thus we take D(T) = 6T + 1. For each prime p 
the specialized curve E t is minimal at p provided that p 2k -j, 6i + 1. If p 2k \6t + 1 then w R (t) = 0, 
so we may define the summands any way we wish; it is convenient to use (14.31 ) to define a t (p), 
even though the curve is not minimal at p. In particular, this implies that a t (p) = for any t where 
p 3 \6t + 1. 

One very nice property of our family is that it only has additive reduction; thus if p\D{t) but 
p 2k \ D{t) then a t (p) = 0. As our weights restrict our family to D{t) being k = 6/k power free, we 
always use (14.31) to define a t (p). 

It is easy to evaluate A\^{p) and A 2 ^(p). While these sums are the average first and second 
moments over primes not dividing the discriminant, as a t (p) = for p\A(t) we may extend these 
sums to be over all primes. 
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We use Theorem 14.21 to write the 1 -level density in a tractable manner. Straightforward calcula- 
tion (see Appendix [BJ] for details) shows that 



MAp) 



1 if p > 5 
otherwise 



2p 2 — 2p if p = 1 mod 3 
otherwise. 



(5.2) 



Not surprisingly, neither the zeroth, first or second moments depend on B or on k; this universality 
leads to the common behavior of the main terms in the n-level densities. We shall see dependence 
on the parameters B and k in the higher moments At,f(p), and this will lead to different lower 
order terms for the different families. 

As we are using Theorem 14 . 21 instead of Theorem ll.li each prime sum is weighted by 



H D ,k(p) 



1 + 



vp{p k 

pk 



1 



vp{p k 

P k 



fls?(p)+flgr(p), 



(5.3) 



with H™™ n (p) = 1. Hf) e l e (p) arises from sieving our family to D(t) being (6/k) -power free. We 
shall calculate the contribution of these two pieces separately. We expect the contribution from 
H s dT{p) to be significantly smaller, as each p-sum is decreased by approximately l/p k . 



5.1.2. Contribution from H^ n (p). 

We first calculate the contributions from the four pieces of H^ n (p). We then combine the 
results, and compare to what we would have had if the Fourier coefficients followed the Sato-Tate 
distribution or for each prime immediately perfectly followed the complex multiplication analogue 
of Sato-Tate. 



Lemma 5.1. Let supp(0) C {—a, a). We have 

20(0) • (2 7PNT - 7 gg, - ji% 



S (F) = 0(0) + 



logR 



O 



log 3 i? 



0(N- 



(5.4) 



where 



(>5) 
7CM;0 



£ 

p>5 

2 log 2 



4 logp 



P(P + 1) 

2 log 3 



0.709919 



1.4255554, 



(5.5) 



and 7pnt is defined in Lemma 13.71 

Note 7c^: is almost 27st ; o (see (13.131) ): the difference is that here p > 5. 
Proof. Substituting for AqAp) an d using (12.81) yields 



41ogp | 



20(0) 



2 logp 2 fn l°gP 



p>5 



plogi? \ logi? 



+ 



1 



loef 



(5.6) 
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The first prime sum converges; using the first million primes we find 7^. ~ 0.709919. The 
remaining piece is 

T 21ogp ? / logp \ _ 20(0) /21og2 21og3\ /^_\ 

The claim now follows from the definition of 733 and using Lemma [3TT1 to evaluate the remaining 
sum. □ 

Lemma 5.2. Let supp (0) C (— a, a) and 

7&&= E 2(5p2 l 2P + n 1 3 ) ' 0gP - 0-6412881898. (5.8) 

p=l mod 3 ^ * 

Then 

= _M + g0M-^O + o ( 1 ) + 0( ^i), (5.9) 
2 logi? V log -Ry 

where 7pnt ; i,3 = —2.375494 (see Lemma \4J[ for its definition). 

Proof. Substituting our formula for A 2 ^(p) and collecting the pieces yields 

q / -r\ 9 V- 2Jogp~/ logp\ 2?(0) 2(5p 2 + 2p+l)logp 

pslmod3 to \ to / to p=l mod 3 y 

The first sum is evaluated by Lemma 14.31 The second sum converges, and was approximated by 
taking the first four million primes. □ 

Lemma 5.3. For the families !Fb,k- y 2 = x 3 + B(6T + 1) K with B e {1, 2, 3, 6} and k e {1, 2}, 
we /zave ^(^) = -^AbA )/ lo % R + °( lo S~ 3 R )> where 

InS.i,, « -3437 



^/ ( - 1 ' 3 ^- ps 4203 

7 CM;A;1,2 ^ ZU,;) 

7 (1 ' 3) - ps .5670 

'CM;A;2,2 

S;3, 2 « " 1413 

tSJL, « - 26 20; (5-11) 



f/ze error w a? mcst .0367. 



Proof. As the sum converges, we have written a program in C (using PARI as a library) to approxi- 
mate the answer. We used all primes p < 48611 (the first 5000 primes), which gives us an error of 
at most about ■ p+1 ^ 2 ^ ~ .0367. The error should be significantly less, as this is assuming no 
oscillation. We also expect to gain a factor of 1/2 as half the primes have zero contribution. □ 

Remark 5.4. When k — 1 a simple change of variables shows that all four values of B lead to the 
same behavior. The case of k = 2 is more interesting. If k = 2 and B — 1, then we have the torsion 
point (0, 6T + 1) on the elliptic surface. If B e {2, 3, 6} and (-) = 1 then (0, 6* + 1 mod p) is on 

the curve E t mod p, while if (— ) = — 1 then (0, 6t + 1 mod p) is not on the reduced curve. 
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5.1.3. Contribution from H^ e J. e (p). 



Lemma 5.5. Notation as in Lemma 15.31 the contributions from the Hff™ (p) sieved terms to the 
lower order corrections are 



2 (7cm , sieve;012 : 7cm, sieve;_B,K 

)0(O) 



log-R 

/CM, sieve;012 

(1,3) 
7CM, sieve; 1,1 

(1,3) 
7cM, sieve;l,2 

(1,3) 
7cM, sieved;2,2 

(1,3) 
7CM, sieve;3,2 

(1,3) 
7CM, sieve;6,2 



+ 



lo^R 



(5.12) 



.004288 
.000446 
.000699 
.000761 
.000125 
.000199, 



(5.13) 



where the errors in the constants are at most 10 15 (we are displaying fewer digits than we could!). 

Proof. The presence of the additional factor of 1/p 3 ensures that we have very rapid convergence. 
The contribution from the r > 3 terms was calculated at the same time as the contribution in Lemma 
15.31 and is denoted by Tciviiieve-B n- The other terms (r G {0, 1, 2}) were computed in analogous 

fl 3) 

manners as before, and grouped together into 7cm, S ieve-oi2- 
5.1.4. Results. We have shown 



□ 



Theorem 5.6. For a < 2/3, the H^^ip) terms contribute 0(O)/2 to the main term. The lower 
order correction from the H^ n (p) and Hffjfij)) terms is 

o2(f\\ Ov _, —,(1,3) (1,3) (1,3) (1,3) x 

2(p{U) • (^7p NT - 7 C M;0 _ 7 2 ,3 " 7PNT;1,3 + 7 C M;2 _ ^CM;A,B,K ~ 7 CM, S ieve;012 ~ 7 C M, S ieve;B,J 



logR 



+ 



log d i? 



(5.14) 



Using the numerical values of our constants for the five choices of(B, k) gives, up to errors of size 
0(log~ 3 R), lower order terms of approximately 



B 
B 
B 
B 
B 



1, K 

1, K 

2, K 

3, K 
6, K 



-2.124 -20(0)/ log R, 
-2.201-20(0)/ log R, 
-2.347 -20(0)/ log R 
-1.921 ■ 20(0)/ log R 
-2.042 -20(0)/ log R. 



These should be contrasted to the family of cuspidal newforms, whose correction term was 

20(0) ooocro 20(0) 



7pnt • 



logi? 



-1.33258 



logiT 



(5.15) 



(5.16) 
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Remark 5.7. The most interesting piece in the lower order terms is from the weighted moment 
sums with r > 3 (see Lemma 1531) ; note the contribution from the sieving is significantly smaller 
(see Lemma [531) . As each curve in the family has complex multiplication, we expect the limiting 
distribution of the Fourier coefficients to differ from Sato-Tate; however, the coefficients satisfy 
a related distribution (it is uniform if we consider the related curve over the quadratic field; see 
HMurlO . This distribution is even, and the even moments are: 2, 6, 20, 70, 252 and so on. In 
general, the 2£ th moment is D e = 2 • ~ ( 2 ^) (the factor of 2 is because the coefficients vanish for 
p = 2 mod 3, so those congruent to 2 modulo 3 contribute double); note the 2£ th moment of the 
Sato-Tate distribution is Ce = j^j Q ) . The generating function is 

1 — / l — zL °° 
gcM(x) = / = X - 2x = Qx 2 + 20x 3 + 12Qx 4 + ■ ■ ■ = V D t x £ (5.17) 

(see sequence A000984 in [JSTJ ). The contribution from the r > 3 terms is 

20(0) ^ ( p -i)iogp - / p y 

to p=lmod3 y 1=2 vv ^ 1 7 

Using the generating function, we see that the £-sum is just 2(3p + l)/(p — l)(p + l) 2 , so the 
contribution is 

2?(0) v 2(3p+l)logp _ 2 7cm 3 ;A ?(°) 

logi? ^ (p+i) 3 log/? ' 1 ' ) 

to p=l mod 3 V ^ ' to 

where taking the first million primes yields 

7cm;! ~ -38184489. (5.20) 

It is interesting to compare the expected contribution from the Complex Multiplication distribution 
(for the moments r > 3) and that from the Sato-Tate distribution (for the moments r > 3). The 
contribution from the Sato-Tate, in this case, was shown in Lemma [231 to be 

2 7sT . r 0(0) 

Sa,o(F) = , D , 7st « 0.4160714430. (5.21) 

log it 

Note how close this is to .38184489, the contribution from the Complex Multiplication distribution. 

5.2. CM Example: The family?/ 2 = x 3 - J B(36T + 6)(36T + 5)xoverQ(T). The analysis of this 
family proceeds almost identically to the analysis for the families y 2 = x 3 + B(6T+ 1) K over Q(T), 
with trivial modifications because D{T) has two factors; note no prime can simultaneously divide 
both factors, and each factor is of degree 1. The main difference is that now a t (p) = whenever 
p = 3 mod 4 (as is seen by sending x — > —x). We therefore content ourselves with summarizing 
the main new feature. 

There are two interesting cases. If B = 1 then the family has rank 1 over Q(T) (see Lemma 
IB.5D ; note in this case that we have the point (36T + 6, 36T + 6). If B = 2 then the family 
has rank over Q(T). This follows by trivially modifying the proof in Lemma [B31 resulting in 
•A.\,r{p) — ~ 2p(f) if P = 1 m °d 4 and otherwise (which averages to by Dirichlet's Theorem 
for primes in arithmetic progressions). 

As with the previous family, the most interesting pieces are the lower order correction terms from 
Sr(J^), namely the pieces from H^ n (p) and H^ e (p) (as we must sieve). We record the results 
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from numerical calculations using the first 10,000 primes. We write the main term as 1^^ B (the 

(1, 4) denotes that there is only a contribution from p = 1 mod 4) and the sieve term as 7cm lieve-s- 
We find that 



^CMAl ~ - - 1109 ^CMiieve;! ~ "-0003 

^.4,2 ~ - 6279 ^CMiieve^ ~ -0013. 



(5.22) 



What is fascinating here is that, when B = 1, the value of 1qm-ab * s S1 g n ifi can tly lower than 
what we would predict for a family with complex multiplication. A natural explanation for this is 
that the distribution corresponding to Sato-Tate for curves with complex multiplication cannot be 
the full story (even in the limit) for a family with rank. Rosen and Silverman [RoSiJ prove 

Theorem 5.8 (Rosen-Silverman). Assume Tate's conjecture holds for a one-parameter family £ of 
elliptic curves y 2 = x 3 + A(T)x + B(T) over Q(T) (Tate's conjecture is known to hold for rational 
surfaces). Let A £ (p) = 1 £ t modp a t (p). Then 

lim — V -A £ (p) logp = rank£(Q(T)). (5.23) 

p<X 

Thus if the elliptic curves have positive rank, there is a slight bias among the a t (p) to be negative. 
For a fixed prime p the bias is roughly of size — r for each a t (p), where r is the rank over Q(T) and 
each a t (p) is of size ^Jp. While in the limit as p — > oo the ratio of the bias to a t (p) tends to zero, it 

is the small primes that contribute most to the lower order terms. As 7^ 1,4 ' ) r arises from weighted 

sums of a t (p) 3 , we expect this term to be smaller for curves with rank; this is born out beautifully 
by our data (see (15.221) ). 

5.3. Non-CM Example: The family y 2 = x 3 - 3x + 12T over Q(T). We consider the family 
y 2 = x 3 — 3x + 12T over Q(T); note this family does not have complex multiplication. For all t the 
above is a global minimal Weierstrass equation, and a t (2) = a t (3) = 0. Straightforward calculation 
(see Appendix IB . 3 1 for details) shows that 



-2 ifp>5 

otherwise 

© + (f ) if P > 5 

otherwise 



2p-2-p(— ) ifp>5 

A 2 AP) = { n KpJ ~. (5-24) 

otherwise. 

Unlike our families with complex multiplication (which only had additive reduction), here we 
have multiplicative reductiorEl and must calculate A' m jr(p). We have 

ifp = 2,3 

A^ T {P) = { 2 if mis even (5.25) 

© + (f) if mis odd; 



13 



As we have multiplicative reduction, for each t as p — > oo the a t {p) satisfy Sato-Tate; see MCHT1 Tay ]. 
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this follows (see Appendix IB .3 1) from the fact that for a given p there are only two t modulo p such 
that p\A(t), and one has a t (p) = (-) and the other has a t (p) = (— ) . 

We sketch the evaluations of the terms from (14.181) of Theorem 14.21 for this family, note that 
Ho,k(p) = 1- We constantly use the results from Appendix IB .31 



Lemma 5.9. We have S A ,(F) = -2 7 _2 } 0(O)/ \ogR + 0(log" 3 R), where 



( 



(3) 



£ bg^ + 



p>5 



p J — p 



E 



p>5 
p=l mod 12 



logp 
p 2 — 1 



E 

= 5 mod 12 



logp 



\ 



p 



1 



-0.082971426. (5.26) 



/ 



© m +(?) m 

and using the geometric series formula. 
Lemma 5.10. We have 



Proof. As A! m jr(p) = (-) + (— ) , the result follows by separately evaluating m even and odd, 

□ 



2<K0)-( 7 J 3) +7^-27pnt) 



S {F) = 0(0) 



(i) 



where 



(3) 

7o 



E 



logR 
(4p — 2) logp 



+ 



loef i2 



p 2 (p + 1) 

7pnt is defined in Lemma \3A\ and 7^3 is defined in Lemma \5J\ 



0.331539448, 



(5.27) 



(5.28) 



p>5 
,(!) 



Proof. For p > 5 we have Ad^O 5 ) = p — 2. The 7 o 3 ' ) term comes from collecting the pieces whose 
prime sum converges for any bounded (and replacing 0(2 logp/ log R) with 0(0) at a cost of 
0(log~ 2 i?)), while the remaining pieces come from using Lemma I3TT1 to evaluate the prime sum 
which converges due to the compact support of 0. □ 

Lemma 5.11. We have S^J 7 ) = -2 7 f } 0(O)/ log-R + 0(log" 3 R), where 



7i 



(3) 



E 

p>5 



=3 

V 



(p ~ 1) logP 
p 2 (p + l) 2 



-0.013643784. 



(5.29) 



Proof. As the prime sums decay like 1/p 2 , we may replace 0(logp/ logi?) with 0(0) at a cost of 

■= ; ' ' ' © + (? 



0(log R). The claim follows from Ai^(p) = (-) + (— ) and simple algebra. 
Lemma 5.12. We have 



□ 



0(0) 20(0) -( 7 f -ItS+Tpnt) 



logi? 



+ 



1 



log'R 



(5.30) 



where 



(3) 
72 



(2 - (f )) p 4 - (13 + 7)(^)p 3 - (25 + 6(^))p 2 - (16 + 2(f ))p - 4) logp 



E — 

.085627. 



p 3 (p + ly 



(5.31) 
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Proof. For p > 5 we have Aq^(p) = p 2 — 2p — 2 — {—)p- The term comes from collecting the 

pieces whose prime sum converges for any bounded 6 (and replacing 0(21ogp/ \ogR) with 0(0) 
at a cost of 0(log~ 2 R)), while the remaining pieces come from using Lemma [3TTJ to evaluate the 
prime sum which converges due to the compact support of 0. □ 

Lemma 5.13. We have S A (F) = -2^~ ] ^(0)/ logR + 0(log" 3 R), where 

7^ « .3369. (5.32) 
Proof. As the series converges, this follows by direct evaluation. □ 
We have shown 

Theorem 5.14. The So(F) and S2{J r ) terms contribute 6(0) /2 to the main term. The lower order 
correction terms are 

V 7 ' O ( ) ; (5.33) 



hgR Vog'R 
using the calculated and computed values of these constants gives 



_ M 3.* + f ' I. (5.34) 



logi? \log 6 R 

Our result should be contrasted to the family of cuspidal newforms, where the correction term 
was of size 

26(0) 26(0) 
7pnt ■ ~ -1.33258 ■ (5.35) 

log R log R 

Remark 5.15. It is not surprising that our family of elliptic curves has a different lower order 
correction than the family of cuspidal newforms. This is due, in large part, to the fact that we do not 
have immediate convergence to the Sato-Tate distribution for the coefficients. This is exasperated 
by the fact that most of the contribution to the lower order corrections comes from the small primes. 

Appendix A. Evaluation of S a (F) for the family of cuspidal newforms 
Lemma A.l. Notation as in $3\ we have 



SAr) = _ W(0) + Q ( l \ +o (W\ +0 log R 



\ogR \R u log 2 Rj \N- 73 J V N 



In particular, for test functions supported in (—4/3, 4/3) we have 



(A.36) 



2 7sT . j 0(0) , . 
SaW = T o — + ( R ~ e ) » (A. 37) 

log it v ' 



where 7 ST .j w .4160714430 {see Lemma\2J\l. 
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Proof. Recall 



<? (t\ - <?M(\\ sr A rAp)p r/2 (p - 1) ipgp 

S A {T) - -20(0)^ {v + l) r +H o g R ■ 



p r=3 



(A.38) 



Using |A r jr(p)| < 2 r , we may easily bound the contribution from r large, say r > 1 + 21ogi?. 
These terms contribute 



< 



E E 

p r=l+21ogi? 



2 r p r / 2 [p — 1) logp 
(p + l) r+1 log/? 



— ^logp ^ 



logi? 



r=l+21og_R 



2yf 
p+ 1 



21ogR 



logi? 



2007 



2^2 



21og_R 



E 

p>2008 



logp 



(21o gj R)/3 



P 



1 



# 77 logiT 



note it is essential that 2y/2/3 < 1. Thus it suffices to study r < 2 log i?. 



(A.39) 



p r=3 fc=0 



20(Q) (p-i)iogp ^^ / 



p 



logi? 

20(0) 



p + 

Og_R 

E E E <v- 



p * £=2 

21ogi? r/2 



+ 



2fc" 



(P+1)V \R 77 \ogR 



logi?^^ f-f ""' "* (p+l) r+1 



(A.40) 



In Lemma [23] we handled the first p and £-sum when we summed over all £ > 2; however, the 
contribution from £ > log R is bounded by (8/9) logjR < R~ n . Thus 



S A {F) 



27ST;3 0(0) 



+ 



1 



\ogR ' \R u \ogR / 

21o gj R(r-2)/2 /2 

^r,^;fc(p)p r/2 (p - 1) logp 



20(0) 
logi? 



EE E * 

p r=3 fc=0 



r- 2fe" 



(p + i; 



r+1 



(A.41) 



To finish the analysis we must study the fe^r-^fcAy^fcCp) terms. Trivial estimation suffices for all 
r when p > 13; in fact, bounding these terms for small primes is what necessitated our restricting 
to r < 2 log R. From (13.61 ) (the Petersson formula with harmonic weights) we find 



(A.42) 



p (r-2k)/± log f p (r-2k)/4 N \ rP r ^\og(pN) 
A r ,F;k\Jp) < TT77TTT < 



N 
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As I Si=o Kr-2k\ < 2 r , we have 



27 ST; .4 0(O) 
logR 



+ 



R u \ogR 



O 



1 ^ 2 ^ %2 r p 3r / 4 log(piV) ' 
^ (p + l) r logR 

p r=3 ' 



. (A.43) 



As our Schwartz test functions restrict p to be at most R 7 , the second error term is bounded by 



AHog. 

logR 

N 

logR 

N 



E E 

.p<2007 r=3 



21ogfl /n _3/ 4 



r=3 

2p 3 



21o S i? 



2007 



p+ 1 

2 . 33/4 \ 2l °g R 



E E 

p>2008 r=3 



2p 



3/4 



p + 1 

2p 3/4 



(—J logi?+ £ 



iV^log 2 ^ log^R 



+ T Ef- 1/4 « 



p>2008 

log 2 /? A^ 3cr / 4 log 



p=2011 



N 



73 



AT 



which is negligible provided that a < 4/3. 



(A.44) 

□ 



Appendix B. Evaluation of A r ^ for families of elliptic curves 

The following standard result allows us to evaluate the second moment of many one-parameter 
families of elliptic curves over Q (see [ALM, BEW] for a proof). 

Lemma B.l (Quadratic Legendre Sums). Assume a and b are not both zero mod p and p > 2. Then 

^(al±bt±c\ = f(p-l)© ifp|6 2 -4ac 
££ V V ) \-(f) otherwise. 

B.l. The family y 2 = x 3 + B(6T + 1) K over Q(T). 

In the arguments below, we constantly use the fact that if p\A(t) then a t (p) = 0. This allows us 
to ignore the p^{. A(t) conditions. We assume B G {1, 2, 3, 6} and k e {1, 2}. 

Lemma B.2. We have 

I V — 1 if p > 5 

A^P = i n a " • (B - 2) 

1 otherwise. 

Proof. We have Aq,f(p) = if p = 2 or 3 because, in these cases, there are no t such that A(t). 
If p > 5 then p -j. A(t) is equivalent to p -[ _B(6t + 1) mod p. As 6 is invertible mod p, as t ranges 
over Z/pZ there is exactly one value such that B(6t + 1) = mod p, and the claim follows. □ 

Lemma B.3. We have Ai^(p) = 0. 
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Proof. The claim is immediate for p = 2,3orp = 2 mod 3; it is also clear when k = 1. Thus we 
assume below that p = 1 mod 3 and k = 2: 



t mod p 

= E E p±M--) = E E (^). (B .3, 

t mod p x mod p t mod p x mod p 

The x = term gives (— ) (p — 1), and the remaining p— 1 values of x each give — (— ) by Lemm aTJTTl 
Therefore A\^(p) = 0. □ 

Lemma B.4. We have A2,rip) = 2p 2 — 2p ifp = 1 mod 3, and otherwise. 

Proof. The claim is immediate for p = 2, 3 or p = 2 mod 3. We do the proof for the harder case of 
k = 2; the result is the same when k = 1 and follows similarly. For p = 1 mod 3: 

ahp) = E = E E E ( ^ — Mr — p- — L ) 

t mod p t mod p x mod p 1/ mod p ' 

^ ^ f^±Bf\ fy 3 + 5t 2 



p i \ p 

t mod p a; mod p y mod p 

V + Bt 2 \ fy 3 + Bt : 



EE E 



X P J \ V 

t=l x{p) y mod p 

y> J2 J2 ^ f±x^±_B\ fty 3 + 5 

4=1 x mod p 1/ mod p 

E E E (^){^)-AjY ^> 



V J \ V J \ V 
tx 3 + B \ fty 3 + B\ 2 fB 2 



x mod p ?/ mod p t mod p 

> p ) ( p 



We use inclusion / exclusion to reduce to xy ^ 0. If x = 0, the t and ?/-sums give p(— ) (— ). 
If y = 0, the t and x-sums give p(— )(— ). We subtract the doubly counted contribution from 



x — y — 0, which gives p(— ) (— ) . Thus 



p/ \p 

V- /tx 3 + fi\ fty 3 + B\ 



x=l J/=1 t mod p 

By LemmaED the t-sum is (p - 1)(^) if p|5 2 (x 3 - y 3 ) 2 and -(^) otherwise; as B\6°° 
we have p 4- B. As p = 6m + 1, let g be a generator of the multiplicative group Z/pZ. Solving 
g 3a = g 3b yields b = a, a + 2m, or a + 4m, so x 3 = y 3 three times (for x, y ^ mod p). In each 
instance y equals x times a square (1, g 2m , g 4m ). Thus 

p— i p— i p— i p— i 



mm = EEp-EEm+f-p 2 

= (p-l)3p + p-p 2 = 2p 2 -2p. (B.6) 
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□ 



B.2. The family y 2 = x 3 -(36T+6)(36T+5)x over Q(T). In the arguments below, we constantly 
use the fact that if p\A(t) then a t (p) = 0. This allows us to ignore the A(t) conditions. 

Lemma B.5. We have Ao t ^(p) = p — 2 ifp > 3 and otherwise. 

Proof. We have A ^(p) = Oifp = 2 because there are no t such that pj A (t). Ifp > 3thenp|A(t) 
is equivalent to p •{ (36t + 6)(36£ + 5) mod p. As 36 is invertible mod p, as t ranges over Z/pZ 
there are exactly two values such that (36t + 6) (36 + 5) = mod p, and the claim follows. □ 

Lemma B.6. We have Ai t r{p) = —2p ifp = 1 mod A and otherwise. 

Proof. The claim is immediate if p = 2 or p = 3 mod 4. If p = 1 mod 4 then we may replace 
36t + 6 with t in the complete sums, and we find that 

E E e (f) E (b.7) 

t mod p a; mod p x mod p t mod p 

As p = 1 mod 4, —1 is a square, say — 1 = a 2 mod p. Thus (— ) = (-) above. Further by Lemma 
IB. li the t-sum is p — 1 if p divides the discriminant 1 + Ax 2 , and is —1 otherwise. There are always 
exactly two distinct solutions to 1 + Ax 2 = mod p for p = 1 mod 4, and both roots are squares 
modulo p. 

To see this, letting w denote the inverse of w modulo p we find the two solutions are ±2a. As 
(f) = g) and (f) = 1, we have (f) = (f ). Letp = 4n + l. Then g) = (-1)^ 2 -Wa = 
and by Euler's criterion we have 

(fj = a^' 2 = {a 2 ) ip - m = (-l)»modp. (B.8) 

Thus (— ) = 1, and the two roots to 1 + Ax 2 = mod p are both squares. Therefore 

Ai,Ap) = -2p+ (~) = ~ 2 P- (B - 9) 

x mod p 

□ 

Remark B.7. By the results of Rosen and Silverman [RoSi], our family has rank 1 over Q(T); this 
is not surprising as we have forced the point (36T + 6, 36T + 6) to lie on the curve over Q(T). 

Lemma B.8. Let E denote the elliptic curve y 2 = x 3 — x, with a^(p) the corresponding Fourier 
coefficient. We have 

A 2 AP) = i MP " 3) " MP)2 ^ P " 1 4 (B.IO) 
I otherwise. 

Proof. The proof follows by similar calculations as above. □ 
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B.3. The family y 2 = x 3 - 3x + 12T over Q(T). For the family y 2 = x 3 - 3x + 12T, we have 

c 4 (T) = 2 4 • 3 2 
c 6 (T) = 2 7 • 3 4 T 

A(T) = 2 6 -3 3 (6T-l)(6T + l); (B.ll) 

further direct calculation shows that a t {2) = a t (3) = for all t. Thus our equation is a global 
minimal Weierstrass equation, and we need only worry about primes p > 5. Note that Ci(t) and 
A(t) are never divisible by a prime p > 5; thus this family can only have multiplicative reduction 
for primes exceeding 3. 

If p 1 6t — l, replacing x with x +1 (to move the singular point to (0, 0)) gives y 2 —3x 2 = mod p. 
The reduction is split if v^ € F p and non-split otherwise. Thus if p|6t — 1 then a t (p) = (-). A 
similar argument (sending x to x — 1) shows that if p\6t + 1 then a t (p) = A straightforward 
calculation shows 

'3\ f 1 if p = 1, 11 mod 12 /^3\ f 1 ifp=l, 7 mod 12 



(B-12) 

p) 1—1 ifp = 5, 7 mod 12, \p / 1—1 if p = 5, 11 mod 12. 
Lemma B.9. We have Aq^(j>) = p — 2 z/p > 3 ^nJ otherwise. 

Proof. We have -Ao^p) = if p = 2 or 3 by direct computation. As 12 is invertible mod p, as t 
ranges over Z/pZ there are exactly two values such that (6t — l)(6i + 1) = mod p, and the claim 
follows. □ 

Lemma B.10. Ai^{2) = Ai t ^(3) = 0, and for p > 5 we have 

( 2 if p = 1 mod 12 
Aif(p) = (~) + (~) = { if P = 7, 11 mod 12 (B.13) 
^ ^ P ' (-2 ifp = 5modl2. 

Proof. The claim is immediate for p < 3. We have 

Ai,ap) = - Yl a *(p) 

t mod p 
A(t)^0 mod p 

^ fx 3 -3x + 12t\ . fx 3 -3x+ 12 



P / \ V 

t mod p < mod P 

A(t)=0 mod p 



= 0+(-) + P ); (B.14) 



the last line follows from our formulas for a t (p) for p\A(t). □ 
Lemma B.ll. A 2 ^(2) = A 2 ,f(3) = 0, and for p >5we have A 2 ,f(p) = p 2 - 3p - 4 - 2(^). 
Proof. The claim is immediate for p < 3. For p > 5 we have a t (p) 2 = 1 if p\A(t). Thus 

a 2 ,ap) = °^) 2 



t mod p 
A(t)^0 mod p 



s-iyi, mi_n_i p 

_ E E E M)(W_ 2 , (B , 5) 

t mod p x mod p 1/ mod p 
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Sending t — > 12 1 t mod p, we have a quadratic in t with discriminant 

((x 3 - 3x) - {y 3 - 3y)f = (x - yf ■ {y 2 + xy + x 2 - 3) 2 = 6{x,y). 



(B.16) 



We use Lemma |BTI to evaluate the t-sum; it is p — 1 if p\S(x, y), and —1 otherwise. Letting 

r](x, y) — 1 if p\5(x, y) and otherwise, we have 

A 2 Ap) = v{x,y)p-p 2 -2. (B.17) 

x mod p y mod p 

For a fixed a;, y) if y = x or if y 2 + xy + x 2 — 3 = mod p (we must be careful about 

double counting). There are two distinct solutions to the quadratic (in y) if its discriminant 12 — 3a; 2 
is a non-zero square in Z/pZ, one solution (namely — 2~ 1 x, which is not equivalent to x) if it is 
congruent to zero (which happens only when x = ±2 mod p), and no solutions otherwise. If the 
discriminant 12 — 3x 2 is a square, the two solutions are distinct from x provided that x ^ ±1 mod p 
(if x = ±1 mod p then one of the solutions is x and the other is distinct). Thus, for a fixed x, the 
number of y such that p\S(x, y) is 2 + ( 12 ~ 3x2 ) if x ^ ±1, ±2 and 2 if x = ±1, ±2. Therefore 

12 - 3x 2N 



AHp) 



E 



x mod p 
C^±l,±2 mod p 



2 + 



P 



p+ 5^ 2 ' p 

x=±l,±2 mod p 



P 



x mod p 
cc^±l,±2 mod p 



2(p-4)p + p ^ 

mod 
,±2 i 

'12 - 3x 2 



12 - 3x 2 
p 



4 • 2p - p 2 - 2 



P 



2+p ( U 3X2 ) -2p = p 2 -2p-2 ~ p(— ) 

« mod p ^ P VP/ 



(B.18) 



where we used Lemma IbTTI to evaluate the x-sum (as p > 5, p does not divide its discriminant). □ 
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